TreeScope: Finding Structural Anomalies In Semi-Structured Data
نویسندگان
چکیده
Semi-structured data are prevalent on the web, with formats such as XML and JSON soaring in popularity due to their generality, flexibility and easy customization. However, these same features make semi-structured data prone to a range of data quality errors, from errors in content to errors in structure. While the former has been well studied, not much attention has been paid to structural errors, which can impact applications quite severely. In this demonstration, we present TREESCOPE, which analyzes semi-structured data sets with the goal of automatically identifying structural anomalies from the data. Our techniques learn robust structural models that have high support, to identify potential errors in the structure. Identified structural anomalies are then concisely summarized to provide plausible explanations of the potential errors. The goal of this demonstration is to enable an interactive exploration of the process of identifying and summarizing structural anomalies in semi-structured data sets.
منابع مشابه
Designing Good Semi-structured Databases
Semi-structured data has become prevalent with the growth of the Internet and other on-line information repositories. Many organizational databases are presented on the web as semi-structured data. Designing a \good" semi-structured database is increasingly crucial to prevent data redundancy, inconsistency and updating anomalies. In this paper, we deene a semi-structured schema graph and identi...
متن کاملDesigning Semistructured Databases: A Conceptual Approach
Semi-structured data has become prevalent with the growth of the Internet. The data is usually stored in a traditional database system or in a specialized repository. While many information providers have presented their databases on the web as semi-structured data, other information providers are developing repositories for new application. One such application is e-commerce, which is emerging...
متن کاملO-28: Detection of Fetal Major Structural Abnormalities with US in ART Patients during One Year
Background: The aims were to determine the diagnostic accuracy of ultrasound sonography in detecting major structural anomalies on all patients who conceived during a year of infertility treatment [assisted reproductive technology (ART) or non-ART treatments] at the Royan Institute, and to study the outcome of cases with nuchal translucency (NT) ≥ 95th centile in the first trimester of pregnanc...
متن کاملArchitectural approach for handling semi-structured data in a user-centred working environment
Purpose of this paper Today the amount of all kind of digital data (e.g., documents and e-mails), existing on every user’s computer, is continuously growing. Users are faced with huge difficulties when it comes to handling the existing data pool and finding specific information respectively. We aim to discover new ways of searching and finding semi-structured data by integrating semantic metadata.
متن کاملA Survey on dental anomalies among children with lip and palate clefts in Mashhad Dental School (2000)
A Survey on dental anomalies among children with lip and palate clefts in Mashhad Dental School (2000) Dr. BM. Ajami * - Dr. M. Talebi ** * - Associate Professor of Pedodontics Dept. - Faculty of Dentistry – Mashhad University of Medical Sciences. ** - Assistant Professor of Pedodontics Dept. Faculty of Dentistry – Mashhad University of Medical Sciences. Background and Aim: Children born with l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 8 شماره
صفحات -
تاریخ انتشار 2015